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I.  INTRODUCTION 


The  problem  of  analysis  of  data  coming  from  "sensitivity  experiments" 
in  the  context  of  Ballistics  Research  is  classically  old.  Equally 
classical  is  the  problem  of  analysis  of  data  originating  from  the  statis¬ 
tically  analogous  and  the  conceptually  parallel  experiments  conducted  in 
biological  and  pharmaceutical  research  in  dosage  mortality  estimations. 

Such  estimation  procedure  has  even  a  wider  horizon,  as  the  same  methodology 
is  applied  also  to  some  reliability  studies  insofar  as  such  studies 
are  characterized  by  the  "stress-strength"  phenomenon.  In  all  these 
areas,  there  has  indeed  been  a  considerable  accumulation  of  data  which 
have  been  analyzed  from  time  to  time  by  various  methods  depending  upon 
the  types  and  designs  of  the  experiments  suited  to  different  models 
characterizing  the  distribution  of  the  response  function.  However,  one 
of  the  methods,  which  is  frequently  resorted  to,  is  the  use  of  the 
maximum  likelihood  estimation  procedure  in  the  set-up  of  the  cumulative 
normal  distribution  as  the  model  for  describing  the  probability  of 
"success"  (or  "failure")  varying  with  the  intensity  of  the  stimulus 
applied.  It  is  generally  believed  that  this  procedure  of  estimation  in 
the  framework  of  normal  response  is  still  one  of  the  best  available  so 
far.  Hence,  in  response  to  frequent  requests  for  analysis  of  such  data, 
the  BRL  ofjAberdeen  Proving  Ground  has  coded  the  solutions  (see  Golub 
and  Grubbs  )  providing  the  estimates  of  p  and  o  along  with  the  estimates 
of  their  asymptotic  variances. 

II.  THE  AIM  OF  THE  PRESENT  PAPER 

The  estimates  of  the  asymptotic  variances  referred  to  above  would 
no  doubt  provide,  in  themselves,  some  measure  of  efficiency  of  the 
estimates  of  p  and  a;  but,  it  appears  that  no  measure  has  been  made 
available  yet  that  would  show  how  efficient  these  estimates  are,  and 
to  what  extent  the  efficiency  could  be  further  improved.  There  is  also 
a  need  for  a  basis  to  measure  the  relative  efficiency  of  the  experiment 
as  a  whole,  the  calculation  of  which  would  be  based  upon  the  maximum 
possible  precision  attainable  under  such  an  estimation  procedure.  Such 
a  measure,  if  it  could  be  provided,  would  furnish  an  idea  of  the  relative 
efficiencies  of  similar  experiments  conducted  at  different  times,  pointing 
possibly  to  the  necessity  for  re-assessment  in  some  cases.  The  aim  of 
this  paper  is  to  provide  such  a  measure  of  efficiency.  The  calculation 
of  efficiency  as  presented  here  is  based  on  the  exposition  of  some 
simple  statistics.  Such  an  exposition  is  found  to  be  helpful  in  under¬ 
standing  more  visibly  some  of  the  well  known  implications  and  limi¬ 
tations  of  the  maximum  likelihood  estimation  procedure  in  the  given  context. 


Golub,  A.,  and  Grubbs,  F.E.,  "Analysis  of  sensitivity  experiments 
when  the  levels  of  stimulus  cannot  be  controlled,"  Ann.  Math.  Stat. 
57  (1956),  257-265. 


In  indicating  how  the  maximum  possible  precision  can  be  attained, 
we  also  aim  at  suggesting  a  good  design  for  such  an  experiment  in  the 
situation  where  the  levels  of  the  stimulus  could  be  reasonably  controlled. 

It  is  not  known  to  the  present  authors  if  anybody  else  has,  in  the 
past,  attempted  such  an  analysis  based  upon  the  exposition  of  the  simple 
statistics  as  presented  in  this  paper. 

II I.  MAXIMUM  LIKELIHOOD  ESTIMATION  PROCEDURE 


Estimating  Equations 


It  is  assumed  that  the  probability  p.  of  success  with  the  stimulus 

x.  is  of  the  form.  1 

1  ‘ 


t . 
1 


(1) 


where  p  and  a  are  the  unknown  parameters  which  require  to  be  estimated. 

We  then  let  to  be  a  random  variable  taking  the  value  1  or  0, 

depending  upon  whether  the  outcome  of  the  trial  is  a  success  or  a  failure 
at  the  level  x.^  with  the  probability  of  success  as  pi  =  Prob  (Y^  =  1), 

and  the  probability  of  failure  as  1  -  p.  =  Prob  (Y^  =  0). 

It  may  be  noted  here  that  success  and  failure  are  relative  terms  and 
that  we  can  interchange  the  definitions.  In  this  case  and  for  the  purpose 
of  discussion  in  this  paper,  we  assume  that  the  probability  of  success 
hecomes  larger  with  the  increased  level  of  the  stimulus. 

The  likelihood  equation  is  given  by 


L  =  n  p  yi  (l-p.)1-yi  ,  (2) 

i  =  l  1  1 

where  p^,  as  noted  in  (1),  is  a  function  of  p  and  o,  and  the  estimating 
equations  are  obtained  as; 
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For  a  given  set  of  data,  we  may  actually  substitute  the  values  of 
y^  as  1  or  0,  depending  upon  whether  the  given  stimulus  has  brought  a 

success  or  a  failure,  and  obtain  the  equations  in  the  following  forms. 
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where  the  subscripts  s  and  f  refer  respectively  to  success  and  failure. 

It  may  be  pointed  out  here  that  z^,  p^,  are  all  functions  of  t^. 

The  argument  t^  is  omitted  here  for  the  sake  of  notational  simplicity. 

It  will  be  introduced  whenever  necessary  to  show  the  argument. 

Since  the  equations  are  not  directly  soluble,  an  iteration  scheme 
as  embodied  in  the  Newton-Raphson  procedure  is  used  to  provide  the  estimates 
of  u  and  a.  For  the^details  of  calculation,  one  may  refer,  among  others, 
to  Golub  and  Grubbs. 

B.  Variances  of  the  Estimates 

The  asymptotic  variance-covariance  matrix  (see  Golub  and  Grubbs*) 
is  given  by 
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In  the  above,  'E'  stands  for  expectation  with  reference  to  Y^,  and 
Auli,  Ayo  and  a00  in  equation  (5)  denote,  respectively,  the  variance  of  y , 
the  covariance  between  u  and  c,  and  the  variance  of  a. 


It  is  easy  to  show  (see,  for  example,  Golub  and  Grubbs1)  that; 
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IV.  LIMITATIONS  AND  RESTRICTIONS  OWING  TO  SMALL  SAMPLE  SIZE 

The  efficiency  of  the  maximum  likelihood  estimation  procedure  would 
depend  upon  how  large  is  the  sample  size.  A  large  sample  would  provide 
unbiased  and  minimum  variance  estimates  of  y  and  o.  Such  a  claim  cannot 
be  made  in  general  when  the  number  of  trials  is  small.  Results  based  on 

2 

simulation  studies  from  small  samples  indicate  that  the  estimates  of  y 
are  unbiased  for  all  practical  purposes,  but  the  estimates  of  o  are  biased, 

being  too  small  on  the  average.  Variances  of  a  have  also  been  found  to  be 
rather  large.  Such  an  instability  of  the  estimate  of  o  has  also  an  adverse 
effect  on  the  efficiency  of  estimates  of  y  and  a,  as  such  estimates  are 
2 

proportional  to  o  (see  equations  (7.1)  -  (7.3)). 

More  importantly,  a  small  sample  may  not  even  be  amenable  to  this 
kind  of  analysis  in  some  situations.  Before  the  data  are  subjected  to 
such  an  analysis,  the  success-failure  sequence  has  to  be  examined.  In 


Visnaw,  V.  V.,  and  Hagan,  J.  S.,  "Analysis  of  sensitivity  data 
following  a  normal  distribution,"  Report  No.  70-AS-113,  Material 
Testing  Directorate,  APG,  Maryland,  1970. 


order  that  it  be  possible  to  estimate  both  p  and  c  with  oe(0,°°), 

there  should  be  a  zone  of  "mixed  results."  If  no  zone  of  "mixed  results" 

occurs,  no  estimate  of  o  other  than  0  can  be  obtained.  (See,  for  example, 

Langlie^).  (No  zone  of  "mixed  results"  means  that  the  highest  stimulus 
level  at  which  a  nonresponse  (in  this  case,  a  failure)  occurred  is 
less  than  the  lowest  stimulus  level  at  which  a  response  (in  this  case,  a 
success)  occurred.) 


V.  ON  THE  CONSISTENCY  OF  THE  EQUATIONS  PROVIDING  THE  ESTIMATES 
A.  Introductory  Remarks  Paving  the  Background 


In  consequence  of  having  to  deal  with  a  small  sample,  we  may  face  a 
situation,  as  referred  to  above,  where  it  may  not  be  possible  to  furnish 
any  maximum  likelihood  estimate  of  p  and  o  at  all,  under  the  assumption 
that  a  is  non-zero  and  finite.  In  particular,  we  need  a  zone  of  "mixed 

results,"  and  that  we  need  such^a  zone  is  demonstrated  in  Langlie. 

As  the  demonstration  in  Langlie  is  based  on  contrapositive  reasoning 
which  is  rather  involved,  one  may  be  interested  in  seeing  more  visibly 
how  a  zone  of  "no  mixed  results"  really  affects  the  consistency  of  the 
estimating  equations. 

It  has  been  observed  in  section  III  that  the  equations  providing  the 
estimates  of  p  and  a  and  the  variance-covariance  matrix  for  these  estimates 
depend  only  upon  z/q,  z/p  and  t.  Thus,  a  portrayal  of  z/q,  z/p  and  the 
difference,  (z/q-z/p) ,  should  provide  the  necessary  information  required 
in  checking  the  consistency  of  the  estimating  equations. 

Diagram  I  shows  the  graphs  of  z/q,  z/p  and  (z/q  -  z/p)  as  functions 
of  t.  Of  these  three  graphs,  we  need  the  first  two  for  the  purpose  of  our 
immediate  discussion.  The  third  graph,  that  is,  the  graph  of  (z/q  -  z/p) 
will  be  referred  to  in  a  later  section  (section  VII). 

Note  that  z(-t)  =  z(t),  p(-t)  =  q(t),  and  q(-t)  =  p(t)  with 

t  c  (-“,»).  Also,  in  the  positive  domain  of  t  with  t  e  [0,<»),  z/q  >  z/p, 

1/2 

q  <  p;  z(0)/p(0)  =  z(0)/q(0)  =  (2/it)  '  .  In  the  negative  domain  of  t, 
z/p  >  z/q,  as  p  takes  the  place  of  q. 


Langlie,  H.  J.,  "A  reliability  test  method  for  "one-shot"  items," 
Publication  No.  U-1792,  Reliability  Branch,  work  performed  under 
US  Army  contract  DA-04-495-ORD-1835,  1962. 


B.  Inevitable  Inconsistency  with  Two  Observations 

Given  that  o  is  nonzero  and  finite,  two  distinct  observations  would 
always  lead  to  inconsistency  in  the  estimating  equations.  Let  us  suppose 
that  both  the  observations,  given  by  t  =  t^  and  t  =  t2,  t^  <  t2,  are  in 

the  positive  domain  of  t,  and  that  one  gives  a  failure  and  the  other,  a 
success.  Then,  equation  (4.1)  would  reduce  to  zft^/qftj)  =  z(t2)/p(t2). 

This  can  never  happen  when  t^  and  t2  are  distinct.  If  both  tj  and  t2 

give  successes,  then  equation  (4.1)  would  reduce  to  z^(tj)/p(tj)  ♦ 

z(t2)/p(t2)  =  0.  Since  each  of  these  two  terms  is  positive,  and  only 

one  can  go  to  0  as  t  +  «,  this  can  never  happen.  Again,  if  the  success 
occurs  in  the  positive  domain  and  the  failure  in  the  negative  domain, 
then  equation  (4.1)  would  reduce  to  zf-tp/qf-t^)  =  z(t2)/p(t2)  which 

would  imply  that  zft^/pft^)  =  z(t2)/p(t2),  and  this  is  impossible,  when 
tj  and  t2  are  distinct.  However,  if  success  occurs  in  the  negative  do¬ 
main,  and  failure  in  the  positive  domain  with  t^  =  -t2>  then  equation 
(4.1)  would  be  consistent  being  given  by  z(-t2)/q(-t2)  =  z(t2)/q(t-,) 
z(t2)/q(t2)  =  z (t7)/q(t2) .  But,  equation  (4.2)  will  be  inconsistent. 
Similar  inconsistency  can  be  demonstrated  in  other  cases  also. 

C •  Other  Types  of  Inconsistencies 

If  there  are  s  trials  in  the  positive  domain,  all  of  which  give 
successes,  equation  (4.1)  will  indicate  inconsistency.  Alternatively,  if 
there  are  f  trials  in  the  negative  domain  all  of  which  give  failures, 
equation  (4.1)  would  lead  to  inconsistency.  In  each  of  such  cases,  there 
will  be  no  "zone  of  mixed  results.” 

D.  On  the  Need  for  a  Zone  of  Mixed  Results 

Suppose  there  are  three  observations  in  the  positive  domain  given  by 
tl  <  x2  <  t3*  an<*  t^iat  failure  occurs  at  tj  and  successes  at  t2  and  t3> 

This  provides  an  illustration  of  a  zone  of  "no  mixed  results."  In  this 

situation,  equation  (4.1)  would  reduce  to  z(t^)/q(t^)  =  z(t2)/p(t2)  + 

z(tj)/p(tj).  It  will  be  evident  from  an  examination  of  the  relative 

magnitudes  of  z/q  and  z/p  and  their  graphs  (Diagram  I)  that  such  an 

equation  can  be  consistent,  although  t^,  t2  and  t^  cannot  take  values 

far  to  the  right.  But,  equation  (4.2)  would  reduce  to  tjZ(t j)/q(tj)  = 

t2z (t2)/p(t2)  +  tjZ (t j)/p(t j) .  Such  an  equation  is  inconsistent  with 
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equation  (4.1),  since  (t^/qft^  =  t^ft^/pCtj)  +  t^  (tj) /p  (tj) 
t2z(t2)/p(t2)  +  t3z(t3)/p(t3) . 

If,  on  the  other  hand,  we  had  successes  at  t^  and  t3>  and  failure  at 
t^,  then  equation  (4.1)  would  reduce  to  z(t2)/q(t2)  =  z(t^)/p(t1)  + 
2(t3)/p(t3).  It  will  be  evident  from  Diagram  1  that  such  a  possibility 
exists,  although  not  in  a  zone  very  far  away  from  0.  Since  t2  lies 
between  t^  and  t3,  equation  (4.2)  will  also  be  consistent.  This  illustrates 
a  zone  of  mixed  results. 

If  h  giving  a  failure  lies  in  the  negative  domain  of  t,  that  is, 

if  tj  is  negative,  while  t2  and  t3  are  in  the  positive  domain  giving 

success,  then  equation  (4.1)  would  reduce  to  z^(t^)/p(t^)  =  z(t2)/p(t0) 

+  z(t.)/p(t3),  which  may  not  be  inconsistent.  Equation  (4.2)  would 

reduce  to  -  tJz(t1)/p(t1)  =  t2z (t2)/p(t2)  +  t3z  (tj)/p(t3) .  This  is 

impossible,  since  a  positive  quantity  cannot  be  equal  to  a  negative 
quantity.  This  is  an  illustration  of  a  zone  without  "mixed  results," 
leading  to  inconsistency. 

We  can  extend  the  above  reasoning  with  reference  to  observations  in 
the  negative  domain  to  conclude  in  general  that  there  will  be  inconsistency 
in  the  equations  arising  from  a  zone  of  "no  mixed  results." 

A  scrutiny  of  the  relative  magnitudes  of  z/q  and  z/p  as  reflected 
in  the  graphical  portrayal  in  Diagram  I  would  reveal  another  important 
limitation  of  this  estimation  procedure.  Observations  few  in  number, 
but  too  far  away  from  t  =  0,  that  is,  too  far  away  from  y  solely  in  the 
positive  domain  of  t,  or  solely  in  the  negative  domain  of  t,  in  spite  of 
giving  a  mixture,  may  tend  to  make  equation  (4.1)  inconsistent.  Such  a 
contingency  will,  of  course,  be  very  unlikely  from  the  practical  point 
of  view. 


VI.  CONSISTENCY  OF  THE  EQUATIONS  RELATIVE  TO  THE  ESTIMATION  OF  o 

The  following  observations  on  the  relative  magnitudes  of  the  com¬ 
ponents  of  equation  (4.2)  which  is  relevant  to  the  estimation  of  o  will 
indicate  how  the  consistency  of  the  equation  may  be  affected  by  a  small 
sample.  Diagram  II  shows  the  graphs  of  tz/q  and  tz/p.  It  is  easy  to  see 
that 
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The  derivative  of  tz/p  is  given  by 

‘  |  (t2  *  H  .  i).  (8) 

P  P 

u 

At  t  =  0,  the  derivative  (8)  is  positive  being  given  by  (2/tt)  .  The 
function  tz/p  is  increasing  to  the  right  of  0,  and  has  a  maximum  at  a 

2 

point  t  satisfying  (t  +  tz/p  -  1)  =  0.  The  solution  of  the  equation  is 
obtained  empirically  at  t  =  0.84  where  the  value  of  the  function  tz/p  is 
0.29. 


Similarly,  the  derivative  of  tz/q  is  also  positive  at  t  =  0  being 

given  by  (2/ir)  .  This  function  goes  to  °=  with  t,  and  does  not  appear  to 
have  a  maximum. 

The  value  of  tz/q  at  t  =  0.84  is  1.18,  while  at  this  point, 
tz/p  =  0.29,  being  the  maximum  in  this  neighborhocd .  On  the  right-hand 
side  of  t  =  0.84,  the  value  of  tz/p  goes  on  decreasing  while  that  of 
tz/q  goes  on  increasing.  It  will  thus  be  evident  from  an  inspection  of 
the  graphs  in  Diagram  II  that  if  observations  are  taken  solely  in  the 
positive  domain,  such  observations  cannot  be  taken  very  far  away  from 
t  =  0.84  to  the  right  to  ensure  that  equation  ( 4.2 )  be  consistent.  Also, 
a  failure  has  to  be  compensated  by  more  than  one  success.  With  an  inter¬ 
change  of  p  and  q,  a  similar  situation  will  emerge  if  observations  are 
taken  solely  in  the  negative  domain  of  t. 

Thus,  Diagrams  I  and  II  jointly  reveal  that,  whether  in  the  positive 
or  in  the  negative  domain,  observations  cannot  be  taken  too  far  away 
from  zero. 


VII.  ON  THE  DIVERGENCE  (z/q  -  z/p) 

Along  with  the  graphs  of  z/q  and  z/p,  the  graph  of  (z/q  -  z/p),  as 
mentioned  in  section  V,  is  also  shown  in  Diagram  1,  as  this  difference 
was  vital  in  checking  the  consistency  of  equation  (4.1)  in  a  small  sample. 
This  divergence  serves  yet  one  more  useful  purpose  ror  which  the  following 
algebraic  development  is  necessar) . 

It  will  be  noticed  that  (z/q  -  z/p)  ■*  «,  meeting  z/q  asymptotically 
as  t  -*■  «.  The  graph  of  (z/q  -  z/p)  might  give  the  impression  as  if  it  is 
a  straight  line.  It  is  not  so.  The  slope  of  the  curve  changes  very 
slowly. 
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Let  C(t)  *  (z/q  -  z/p) .  Its  first  derivative  is  given  by 

C’Ct)  -  C|)2  ♦  (f)2  •  (9) 

It  is  shown  in  Appendix  (A)  that  C'(t)  >  0, indicating  that  C(t)  is 
increasing  to  the  right  of  the  origin.  The  value  of  (9)  at  t  =  0  is 
positive  being  given  by  4/ir. 

It  can  also  be  easily  shown  that 


lim  Cm  4  =  l2725 

t-K)  t  IT 


The  equation  of  the  tangent  to  C(t)  at  t  =  0  can  therefore  be  written  as 


C(t)  =  tan  0  t  , 


where  tan  6  =  1.2733,  with  0  =  51.85°. 

We  show  in  Appendix  (B)  (writing  B(t)  for  f(t)/t)  that 

lim  =  l  .  (12) 

Again,  we  show  in  Appendix  (C)  (writing  tB(t)  for  C(t))  that 

lim  C'(t)  =  1. 
t» 

Hence,  we  may  say  that  the  tangent  at  t  «  makes  an  angle  of  45°  with 
the  t-axis  and  that  equation  (12)  represents  the  limiting  form  of  the 
tangent  at  infinity. 

It  is  thus  observed  that  the  slope  starts  with  an  angle  of  51.85° 
at  0,  and  attains  an  angle  of  45°  at  infinity.  What  happens  in  between 
is  indicated  below. 


The  second  derivative  of  C(t)  is  given  by 

c(t)  -2(4-4>  -  > 

q  p 

-St(4+4)  (13) 

q  p  M 


C"(t)  is  0  at  t  =  0,  and  then  becomes  negative  until  it  goes  to 
zero  again  at  t  =  3.0731,  indicating  that  it  is  a  point  of  inflexion  of 
C(t).  The  values  of  C(t)  and  C’(t)  at  t  =  3.0731  are  obtained  respectively 
as  3.3476  and  0.9426.  Thus,  the  equation  of  the  tangent  to  C(t)  at 
(3.0731,  3.3476)  may  be  obtained  as 


-  -  -  =  0. 9426t  +  0.4509.  (14) 

q  p 

We  can  find  the  equations  of  the  tangents  also  at  other  points.  Such 
equations  will  be  helpful  in  checking  the  solutions  of  some  equations 
for  which  closed  form  solutions  are  not  available  (see  section  VIII). 

The  graph  of  C"(t)  has  some  interesting  features.  It  is  zero  at 
t  =  0,  and  then  it  becomes  negative  and  remains  so  until  it  becomes  zero 
again  at  t  =  3.0731.  From  this  point  on,  it  remains  positive,  going  to 
zero  again  in  the  neighborhood  of  t  =  6.287.  Although  we  do  not  need  to 
know  about  its  behavior  beyond  t  =  3.0731  for  the  present  purpose,  the 
nature  of  the  graph  beyond  this  point  may  be  interesting  from  the  mathematical 
point  of  view. 


VIII.  ON  THE  EFFICIENCY  OF  ESTIMATION  WHEN  THE  EQUATIONS  ARE  CONSISTENT 

A.  The  Basis  for  the  Estimation  of  Efficiency 

The  elements  of  the  information  matrix  A,  given  in  (7.1)  -  (7.3), 
are  rewritten  below  as; 


a  =  -L-  y  — —  =  -L-  i  A(t . ) , 
uu  2  v  p.  q.  2  v  l  ’ 

o  1  *1^1  o  1 


(15.D 


A  =  :r 

uo  L 


l  z-  t. 

-  I  5“  =  -  l  A!,i>v 

a  i  *iHi  o  l  x  x 


(15.2) 
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=  V  i 


2  2 
z.  t. 

1  i 


T  /-  - ~7  >. 

a  ri^i  o  l 


T"  l  A(ti)ti2* 


(15.3) 


where  A(t^)  is  defined  to  be  /'p^q^. 

In  a  large  sample,  we  would  expect  A^o  to  be  0.  Since  A(t^)  is 

positive,  and  that  A(t^)  *  A(-t^),  A^o  would  tend  to  0  with  a  large  set 

of  ±  t.  A  small  sample  also  could  be  taken,  if  observations  could  be 
controlled,  in  such  a  manner  as  to  make  zero,  or  nearly  zero.  In 

fact,  ve  would  take  the  observations,  if  we  could,  at  such  values  of  t 

as  would  make  A  zero,  and,  at  the  same  time,  make  det.  (A)  maximum,  or 

det.  (A1)  minimum,  in  order  that  the  efficiency  of  estimation  be  maxi¬ 
mized.  This  desideratum  of  D-optimality  is  in  keeping  with  the  criterion 
as  is  sometimes  adopted  in  Design  of  Experiments  to  define  the  overall 
efficiency  of  an  experiment.  Such  maximization  of  det  (A)  or  minimization 

of  dot.  (A  )  would  bring  about  an  overall  reduction  of  variances  of  the 
estimates,  although  we  would  have  liked  that  the  variance  of  each  of  the 
estimates  be  minimized  individually,  if  possible. 

Det.  (A)  is  given  by 


Det.  (A)  =  (A  A 

1  ;tyj  oo 


<V  ! 


Expression  (16)  will  be  maximized  if  (A  A  )  is  maximized,  and 
2  2  uv*  0c> 

(Ap<p  *s  miniraize(l'  Since  (A  j  is  non-negative,  the  minimum  possible 

value  would  be  given  by  0,  and  this  would  be  consistent  with  the  require- 

2 

ment  that  the  estimates  of  u  and  o  be  uncorrelated.  Before  the  problem 
of  maximizing  (AyyAaa)  is  taken  up,  it  may  be  appropriate  to  examine 
the  problem  of  maximizing  each  of  A  and  A^  separately. 

If  it  was  desired  to  estimate  only  u,  we  would  have  utilized  only 
one  equation,  equation  (4.1),  for  this  purpose,  and,  in  that  case,  the 
'  -1 

variance  of  p  would  have  been  given  by  (Ayy)  •  Similarly,  if  it  was 

required  to  estimate  only  o,  we  would  have  made  use  of  equation  (4.2), 

- 

and  the  variance  of  o  would  have  been  given  by  (Aoa)  •  Therefore,  in 

these  two  separate  problems,  we  would  have  maximized  either  A  or  A 

pu  oo 
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8.  Maximum  of  A 
_ mu 

Since  =  £A(t^)/o^,  and  each  A(t.)  is  positive,  the  maximum 

of  A  will  be  attained  if  each  Aft.)  is  maximized.  This  condition 
mw  v  i 

will  be  consistent  with  the  admissibility  of  replications  at  a  t^. 
A(t)  is  given  by 

A<t)  =  ^  •  Cl7) 


The  graph  of  (17)  as  a  function  of  t  in  the  positive  domain  of  t  is 
given  in  Diagram  III.  The  graph  shows  that  (17)  has  a  maximum  at  t  =  0. 
Differentiating  (17)  with  respect  to  t,  we  obtain 


A'(t)  =  -  —  (2t  +  -  -  -). 
pq  p  q 


An  obvious  solution  of  the  equation  2t  +  z/p  -  z/q  =  0  is  given  by  t  =  0, 
and,  at  this  point,  the  value  of  A(t)  is  2/ir.  In  view  of  what  has  been 
noted  in  section  VII,  the  equation  (18)  cannot  have  any  other  solution 
since  (z/q  -  z/p)  can  hardly  be  as  large  as  2t.  The  maximum  possible 
value  of  A  ,  admitting  repetitions  at  the  same  value  of  t,  will  be 
given  by 


(19) 


where  n  is  the  size  of  the  sample.  Hence,  the  minimum  possible  variance 
for  u  will  be  given  by 


(20) 


It  may  be  difficult  to  attain  this  ideal  minimum  limit  in  actual  practice. 
But,  observations  close  to  t  =  0,  that  is,  close  to  the  mean,  and  approach¬ 
ing  the  mean  from  both  sides,  would  bring  down  the  variance  close  to  it. 

We  may  approach  the  mean  from  both  sides  because  the  value  of  (17) 
remains  the  same  for  ±  t. 
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Diagram 


C.  Maximum  of  A 

_ CO 

2  2 

Since  A  =  yA(t.)t.  /a  ,  and  each  of  the  constituent  terms  is 
a  a  £  1  1 

1  2 

positive,  Aoo  will  be  maximized,  if  each  (A^  )  is  maximized.  We  have 


A(t)t2 


2J2 

z  t 

pq 


(21) 


The  graph  of  (21)  as  a  function  of  t  in  the  positive  domain  of  t  is  indi¬ 
cated  in  Diagram  (III).  It  would  show  that  (21)  has  a  maximum.  The 

2 

derivative  of  (A(t)t  )  is  given  by 

2 

(A(t)tV  =  -  ^  [2t2  -  t(|  -  p  -  2]  .  (22) 


The  maximum  of  (21 )  will  be  obtained  at  a  point  t  given  by  the 
solution  of  the  equation. 


2t2  -  t(^  -  |)  -  2  =  °  .  (23) 

The  solution  t  =  0  coming  from  (22)  is  ruled  out.  The  zero  of  the  function 
in  the  positive  domain,  found  empirically,  is  given  by  t  =  1.5750,  and 
2 

the  value  of  (A(t)t  )  at  this  point  is  given  by  1.9114/tt.  Hence,  the 
maximum  possible  value  of  Aaa  will  be  given  by 

Sa-Mil)  ,  (24) 

Tto 

where  n  is  the  size  of  the  sample.  Thus,  the  minimum  possible  variance 
of  o  may  be  obtained  as 


2 

no 

n(l . 9114)  • 


(25) 


Here  also,  observations  close  to  t  =  i  1.5750,  that  is, 
+  1.5750a  and  -  1.5750o  from  the  mean,  would  bring  down 
close  to  what  is  shown  in  (25).  The  minus  sign  of  t  is 
the  value  of  (21)  remains  the  same  for  ±  t. 


close  to 
the  variance 
admitted  because 
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The  solution  of  equation  (23),  as  mentioned  above,  has  been  obtained 
empirically.  However,  the  solution  may  be  checked  with  the  help  of  the 
results  indicated  in  section  VII.  The  equation  of  the  tangent  to  C(t) 
at  a  point  (t^,  C(tQ))  may  be  obtained  as  C(t)  =  C '  Ct Q) t  +  C(t^)  -  C'(tg)t0. 

In  this  particular  instance,  t^  =  1.S7S0,  C(tg)  =  1.8802,  and  C'Uq)  = 

1.0644.  Hence,  for  (z/q  -  z/p)  in  equation  (23),  we  may  substitute 
(1.0644t  +  .2037).  With  this  substitution,  the  equation  (23)  will 
reduce  to 

t2  -  . 2177t  -  2.1377  =  0  .  (26) 

The  positive  root  of  equation  (26)  is  obtained  as  1.5750.  This  root 
checks  the  solution  obtained  empirically. 

D.  Maximum  of  (A  A  ) 

_ MM  OQ 

In  the  joint  estimation  of  p  and  o,  we  would  be  interested  in 
maximizing  the  product. 


(A  A  )  =  ^(TA(t.))(TA(t.)t.2) 
MM  CSC  4  V  1  4-  1 


as  (A  )  would  be  expected  to  go  to  zero.  One  way  would  be  to  take 
the  product  of  the  maxima  of  A  and  A^  as  the  required  maximum  of 
(AypAo0) ,  being  given  by 

2n2 (1 . 9114) 

- 2—4 -  >  (28) 

it  a 

where  n  is  the  sample  size.  It  will  hardly  be  possible  to  reach  this 
upper  limit  in  a  problem  of  jointly  estimating  both  p  and  o.  Alternatively, 

2 

a  less  ambitious  maximum  may  be  provided  by  n  times  the  maximum  of  the 
product , 

[A(t)  x  A(t)t2]  =  [A(t)t]2.  (29) 

The  graph  of  (29)  as  a  function  of  t  in  the  positive  domain  is  indicated 
in  Diagram  (III).  The  function  appears  to  have  a  maximum.  The  derivative 
2 

of  (A(t)t)  is  given  by 


[(A(t)tn 


[2t2 

(Pq) 


t  (-  -  -) 

q  p 


In  the  context  of  finding  the  maximum,  the  root  t  =  0  is  ruled 
out.  The  maximum  will  be  given  by  the  positive  root  of  the  equation 

2t2  "  t(t  ’  f3  ‘  1  *  °*  (31 ) 

The  positive  root  is  empirically  obtained  at  t  »  1.1381.  Hence,  the 
observations  have  to  be  taken  close  to  ±(1.1381o)  from  the  mean.  As  in 
the  two  other  cases,  the  minus  sign  is  taken  as  admissible,  because  (29) 
remains  the  same  for  ±  t. 

Here  also,  the  solution  may  be  checked  by  the  same  device  as  referred 
to  earlier. 

The  value  of  A(t)  at  t  =  1.1381  is  given  by 


1 . 2304 

7T 


0.3916 


(32) 


2 

and  the  value  of  (A(t)t  )  at  t  =  1.1381  is  given  by 


1.5937 

ir 


0.5073 


(33) 


Hence,  the  maximum  of  (A  A  )  by  the  maximum  of  the  product  in  (29) 
is  given  by  vv  00 

r  n(l . 2304)  ,  ..  r  n(1.5937)  , 

l  - 2 -  J  x  l  - 2 —  J 

TTO  TT0 

2 

«  (1.9609) 

TI  O 
2 

=  (.1987)  .  (34) 

o 

In  this  set-up,  when  A  =0,  the  minimum  possible  variances  of 
u  and  o  would  be  given  respectively  by 


2 

•no 

n(l . 2304) 


and 


2 

•nq 

n  (1.5937)  ’ 


(35) 
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that  is,  by 


! 


h. 

V 


L 

l 

fe 

f 


t 


2  2 

^■(2.5530)  and  ^-(1.9712)  .  (36) 


E.  Recommendation 


Hence,  in  the  problem  of  joint  estimation  of  y  and  o  with  maximized 
efficiency  for  both,  the  recommended  procedure  would  be  to  aim  at  taking 
the  observations  in  the  neighborhood  of  t  =  ±  1.1381,  that  is,  at  a  dis¬ 
tance  of  (±  1.1381)0  from  the  mean.  If  we  take  the  same  number  of  obser¬ 
vations  on  either  side  of  the  mean  at  the  same  distance,  A  will  be  zero 

no 

In  this  context,  it  may  be  pertinent  to  observe  that  the  point  t  =  0 

2 

makes  A(t)  the  maximum,  but  contributes  nothing  to  A(t)t  .  On  the  other 

2 

hand,  the  points  at  t  =  ±  1.5750  make  A(t)t  the  maximum,  while  adding 
something  positive,  although  not  very  large,  to  A(t). 

In  spite  of  the  fact  that  observations  in  the  neighborhood  of  t  =  0 
would  result  in  increased  efficiency  in  the  estimation  of  y,  while  not 
being  contributory  at  all  to  the  estimation  of  a,  aiming  at  t  =  0  might 
help  in  getting  a  zone  of  mixed  results. 


IX.  ASSESSMENT  OF  EFFICIENCY  OF  A  RECORDED  EXPERIMENT 

Golub  and  Grubbs1  used  this  methodology  to  provide  the  estimates  of 

2  2 

y  and  a,  as  well  as  the  estimates  of  o^  and  a a  for  the  following  experiment. 

In  firing  five  rounds  of  a  given  projectile  of  a  given  armor  plate, 
the  following  observations  were  recorded: 


Velocity  (f/s) 
2433 
2415 
2415 
2453 
2423 


Condition  of  Impact 
Non-penetration 
Non-penetration 
Non -penetration 
Penetration 
Penetration 
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In  the  above  example,  the  sample  size  n  *  S.  The  estimates  of 

*  *  1 

p  and  a  were  obtained  respectively  as  2431.6  f/s  and  15.0  f/s,  and  of 

?  7 

o'  and  o'  as  115. S  and  155.5,  respectively.  The  variance-covariance  matrix 
was  obtained  as 


r  -i 

r  1 

.01011 

-  .00330 

*  X 

115. S 

50.8 

-  .00330 

.00751 

50.8 

155. 5J 

The  value  of  det  (A)  =  .000065036,  while  the  value  of  (34),  substituting 
«2  2 

o  for  a  ,  is  .00009812.  Hence,  the  ratio  of  the  two  is  .6635  which  means 
that  the  efficiency  of  estimation  in  this  particular  experiment  has  been 
obtained  as  66.35%.  (It  may  be  mentioned  l>ere  that  the  efficiency  could 
also  have  been  calculated  by  dividing  a  /n  (.1987)  by  det  (A  )  to  get 
whole  numbers,  rather  than  fractions,  in  the  division.) 

The  ideal  variance  of  p,  by  (36),  would  have  been  114.89,  while,  in 
2 

the  experiment,  o*  has  been  obtained  as  115.5.  Thus,  the  efficiency  in 

the  estimation  of  p  is  obtained  as  114.89/115.50  =  99.47%  which  is  very 
high. 

The  ideal  variance  of  a,  by  (36),  would  have  been  obtained  as  88.70, 

2 

while,  in  this  particular  experiment,  O'  has  been  obtained  as  155.5.  Thus, 
the  efficiency  in  estimating  o  may  be  calculated  as  88.70/155.5  =  57.04%. 

Now,  let  us  examine  the  values  of  t  of  this  particular  experiment 
which  are  given  below: 


R 

t 

2415 

-1.11 

241S 

-1.11 

2423 

-0.57 

2433 

0.09 

2453 

1.43 

* 


One  may  find  slight  arithmetical  discrepancies  in  these  estimates,  but 
such  small  discrepancies  will  not  be  of  any  material  consequence. 


The  values  of  t  in  the  neighborhood  of  t  *  0  have  been  responsible 
to  render  the  variance  of  y  almost  ideally  small.  If  there  were  a  few 
more  observations  in  the  neighborhood  of  t  =  ±  1.57,  the  variances  of 

c  would  have  been  almost  ideally  smaller. 

2 

An  observation  at  t  =  1.43  has  helped  in  the  reduction  of  o*  to 
some  extent .  u 


If  the  estimated  variances  of  y  and  o  as  obtained  in  this  experiment 
would  have  compared  with  the  more  ambitious  limits  of  variances  as  given 
in  (20)  and  (25),  the  calculated  efficiency  of  this  experiment  would 
be  less. 


fs  X.  REMARK 

e 

1  It  appears  that  many  experiments  have,  in  the  past,  been  analyzed 

|  by  this  procedure  and  we  have,  with  us,  the  estimates  of  y,  o,  and 

•"  2  2 

o*,  o».  It  may  be  worthwhile  to  calculate  the  efficiencies  of  those 

vi  a 

: 

experiments  in  the  light  of  this  discussion  to  let  the  experimenters 
know  where  they  stand  in  retrospect  with  regard  to  the  estimates  of 
y  and  a  in  each  case.  This  search  may  be  of  help  in  the  matter  of  per- 
|  spective  planning. 
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APPENDIX  A* 


We  wish  to  show  that  the  first  derivative  of 


is  positive, 
We  have 


CM  - 


C'(t)  =  -  tl(  i  -  i  )  .  22(  *  *2  > 

HP  q*  p 


z  ,  z  -  qt  ,  z  .  z 
q  1  q  p2  p 

The  function  z  -  qt  is  positive  at  t  =  0,  has  a  negative  derivative,  and 
tends  to  zero  as  t  -»  Therefore,  z  -  qt  >  0  for  t  >  0. 

Hence,  (tB(t))  >  0. 


Appendices  A,  B  and  C  are  due  to  Mr.  W.  Egerland. 
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APPENDIX  B 


Consider  the  function, 


Rri-i  -  f"1  r  z(t) 

B(t)  ’  1  C  qTtT  *  pTtl  } 


We  wish  to  show  that  lim 


B(t)  =  1. 


We  note  first,  since  lim  p(t)  =  1,  that 

lim  B(t)  =  lim  ^y 

t-*»  t-H»  ^  ; 


Since  the  limits  of  both  z(t)  and  tq(t)  are  zero  as  t  +  «,  we  may  apply 
1 'Hospital's  rule  for  the  evaluation  of  (*)  to  obtain 


lim  B(t)  =  lim 

t-voa  f*00 


Z'  (t) 


The  right  hand  side  of  (**)  is  again  of  the  form  [0/0] ,  so  we  may  apply 

2 

the  rule  again.  Using  z'(t]  =  -  tz(t),  z"(t)  =  (t  -  l)z(t),  we  have 


lim  B(t)  *  lim  - x - 

t-*D  t-*»  -2z(t)  t  z(t) 


=  lim  - 
t-^°  t 


t2  -  1 


=  1  . 


r 
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We  wish  to  show  that  lim  ltB(t))'  *  1. 

t-N» 

Since  (tB(t))'  »  B(t)  +  tB'(t)  =  B(t)  +  tB(t)  log  B(t),  it 

is  sufficient  to  show  that  tB(t)^-  log  B(t)  -*•  0  as  t  -*•  ®.  Writing 
B(t)  in  the  form 


Dfn  _  zCt)(2p(t)-l) 


it  follows  after  some  manipulation  that 


tB(t)  JJ-  log  B(t)  '  t{  |-  *  jElj  -  |1  -  al  .  1  [  B(t) 


■  1  *  I  a-q  “  ~-a  n  BW 


tz 


tpt2FiT  +  e(t)1  B(t)* 


We  can  omit  B(t),  since  B(t)  •+■  1,  as  t  -+■  ®. 


Remembering  that  t  q  and  tz  go  to  0  as  t  +  ®,  we  have,  by  1' Hospital 


rule. 


lim  6(t)  =  lim  2(  1  +  ) . 

t-M»  t-**° 


But,  the  limit  on  the  right  is  zero,  since  from  Appendix  B, 

U„H.  lim  1. 

t-K*>  t-H» 

Hence,  lim  (tB(t))'  *  1. 
t-**> 
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